Explicit consistency constraints for STFT spectrograms and their application to phase reconstruction
نویسندگان
چکیده
As many acoustic signal processing methods, for example for source separation or noise canceling, operate in the magnitude spectrogram domain, the problem of reconstructing a perceptually good sounding signal from a modified magnitude spectrogram, and more generally to understand what makes a spectrogram consistent, is very important. In this article, we derive the constraints which a set of complex numbers must verify to be a consistent STFT spectrogram, i.e. to be the STFT spectrogram of a real signal, and describe how they lead to an objective function measuring the consistency of a set of complex numbers as a spectrogram. We then present a flexible phase reconstruction algorithm based on a local approximation of the consistency constraints, explain its relation with phase-coherence conditions devised as necessary for a good perceptual sound quality, and derive a real-time time scale modification algorithm based on sliding-block analysis. Finally, we show how inconsistency can be used to develop a spectrogram-based audio encryption scheme.
منابع مشابه
Spectrogram consistency and its application to phase reconstruction
In this article, we derive the constraints which a set of complex numbers must verify to be a consistent STFT spectrogram, i.e., to be the STFT spectrogram of an actual real-valued signal, and describe how they lead to an objective function measuring the consistency of a set of complex numbers as a spectrogram. We then present a flexible phase reconstruction algorithm based on a local approxima...
متن کاملFast Signal Reconstruction from Magnitude Stft Spectrogram Based on Spectrogram Consistency
The modification of magnitude spectrograms is at the core of many audio signal processing methods, from source separation to sound modification or noise canceling, and reconstructing a natural sounding signal in such situations is thus a very important issue. This article presents recent theoretical and experimental developments on the application to signal reconstruction from a modified magnit...
متن کاملConsistent Wiener Filtering: Generalized Time-Frequency Masking Respecting Spectrogram Consistency
Wiener filtering is one of the most widely used methods in audio source separation. It is often applied on time-frequency representations of signals, such as the short-time Fourier transform (STFT), to exploit their short-term stationarity, but so far the design of the Wiener time-frequency mask did not take into account the necessity for the output spectrograms to be consistent, i.e., to corre...
متن کاملGenerative Adversarial Network-Based Postfilter for STFT Spectrograms
We propose a learning-based postfilter to reconstruct the high-fidelity spectral texture in short-term Fourier transform (STFT) spectrograms. In speech-processing systems, such as speech synthesis, conversion, enhancement, separation, and coding, STFT spectrograms have been widely used as key acoustic representations. In these tasks, we normally need to precisely generate or predict the represe...
متن کاملCorrelation of STFT Spectrograms Applied to the Voice Deal Function in Mobile Phones
For the past five decades there has been a substantial increase in the development of computer applications based on the STFT (Short-Time Fourier Transform) spectrograms. Amongst others, a relevant example of this kind of implementation is speech analysis. Indeed, if the audio signal is correctly filtered, it is possible to use STFT spectrogram techniques to recognize speech. This work investig...
متن کامل